Robust Feature Extraction for Bimodal Speech Recognizer
نویسنده
چکیده
منابع مشابه
Enhancing Robustness of Speech Recognizers by Bimodal Features
In this paper a robust speech recognizer is presented based on features obtained from the speech signal and also from the image of the speaker. The features were combined by simple concatenation, resulting composed feature vectors to train the models corresponding to each class. For recognition, the classification process relies on a very effective algorithm, namely the multiclass SVM. Under ad...
متن کاملTitle: Noise robust LVCSR feature extraction based on stabilized weighted linear prediction
In this paper, we evaluate a recently proposed spectral envelope estimation method, stabilized weighted linear prediction (SWLP), in the feature extraction stage of a large vocabulary continuous speech recognizer (LVCSR) system. Using speech recorded in real-world noisy environments, we compare recognition error rates obtained with SWLP to those given by the conventional spectrum estimation met...
متن کاملChallenges in the Fusion of Video and Audio for Robust Speech Recognition
As speech recognizers become more robust, they are popularly accepted as an essential component of human-computer interaction. State-ofthe-art speaker-independent speech recognizers exist with word recognition error rates below 10%. To achieve even higher and robust recognition performance, multi-modal speech recognition techniques that combine video and audio information call be used. Speech r...
متن کاملPerformance Evaluation of Blind Equalization for Mel-LPC based Speech Recognition under Different Noisy Conditions
This study is aimed to develop a noise robust distributed speech recognizer (DSR) for real-world applications by employing Blind Equalization (BEQ) for robust feature extraction. The main focus of the work is to cope with different noisy environments in recognition phase. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by appl...
متن کاملPerformance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments
This study is intended to develop a noise robust distributed speech recognizer for real-world applications by employing Cepstral Mean Normalization (CMN) for robust feature extraction. The main focus of the work is to cope with different noisy environments. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by applying a first-or...
متن کامل